Overview

Dataset statistics

Number of variables76
Number of observations100000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory58.0 MiB
Average record size in memory608.0 B

Variable types

BOOL50
NUM26

Reproduction

Analysis started2022-02-06 14:52:31.853407
Analysis finished2022-02-06 15:02:34.652079
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
initial_list_status2 is highly correlated with initial_list_status1High Correlation
initial_list_status1 is highly correlated with initial_list_status2High Correlation
funded_amnt_inv is highly correlated with funded_amnt and 1 other fieldsHigh Correlation
funded_amnt is highly correlated with funded_amnt_inv and 1 other fieldsHigh Correlation
installment is highly correlated with funded_amnt and 1 other fieldsHigh Correlation
out_prncp_inv is highly correlated with out_prncpHigh Correlation
out_prncp is highly correlated with out_prncp_invHigh Correlation
fico_range_high is highly correlated with fico_range_lowHigh Correlation
fico_range_low is highly correlated with fico_range_highHigh Correlation
annual_inc is highly skewed (γ1 = 50.78562969) Skewed
tot_coll_amt is highly skewed (γ1 = 32.05463979) Skewed
delinq_amnt is highly skewed (γ1 = 64.48766706) Skewed
tax_liens is highly skewed (γ1 = 35.83709048) Skewed
out_prncp is highly skewed (γ1 = 85.44453918) Skewed
out_prncp_inv is highly skewed (γ1 = 85.45945131) Skewed
delinq_2yrs has 79483 (79.5%) zeros Zeros
inq_last_6mths has 56711 (56.7%) zeros Zeros
pub_rec has 81999 (82.0%) zeros Zeros
collections_12_mths_ex_med has 98290 (98.3%) zeros Zeros
acc_now_delinq has 99459 (99.5%) zeros Zeros
tot_coll_amt has 84157 (84.2%) zeros Zeros
chargeoff_within_12_mths has 99129 (99.1%) zeros Zeros
delinq_amnt has 99564 (99.6%) zeros Zeros
tax_liens has 96122 (96.1%) zeros Zeros
total_rec_late_fee has 93433 (93.4%) zeros Zeros
out_prncp has 99973 (> 99.9%) zeros Zeros
out_prncp_inv has 99973 (> 99.9%) zeros Zeros

Variables

int_rate
Real number (ℝ≥0)

Distinct count256
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.130833478
Minimum0.0532
Maximum0.3099
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum0.0532
5-th percentile0.0649
Q10.0975
median0.1274
Q30.158
95-th percentile0.2149
Maximum0.3099
Range0.2567
Interquartile range (IQR)0.0605

Descriptive statistics

Standard deviation0.04477309719
Coefficient of variation (CV)0.3422143772
Kurtosis0.04189338956
Mean0.130833478
Median Absolute Deviation (MAD)0.03564157068
Skewness0.537512902
Sum13083.3478
Variance0.002004630232
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0532 0.05625 0.06015 0.06135 0.06315 ... 0.2667 0.2868 0.28785 0.30615 0.3099 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1099 3149 3.1%
 
0.0532 2450 2.5%
 
0.1299 2395 2.4%
 
0.1399 2362 2.4%
 
0.1199 2223 2.2%
 
0.1149 2170 2.2%
 
0.1699 2134 2.1%
 
0.0917 2091 2.1%
 
0.0789 2008 2.0%
 
0.1561 1985 2.0%
 
Other values (246) 77033 77.0%
 
ValueCountFrequency (%) 
0.0532 2450 2.5%
 
0.0593 137 0.1%
 
0.06 22 < 0.1%
 
0.0603 702 0.7%
 
0.0624 566 0.6%
 
ValueCountFrequency (%) 
0.3099 3 < 0.1%
 
0.3094 3 < 0.1%
 
0.3089 1 < 0.1%
 
0.3084 4 < 0.1%
 
0.3079 3 < 0.1%
 

annual_inc
Real number (ℝ≥0)

SKEWED
Distinct count8905
Unique (%)8.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74360.61148
Minimum5360
Maximum8300000
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum5360
5-th percentile27000
Q145000
median62000
Q390000
95-th percentile150000
Maximum8300000
Range8294640
Interquartile range (IQR)45000

Descriptive statistics

Standard deviation74674.08925
Coefficient of variation (CV)1.004215643
Kurtosis4901.801967
Mean74360.61148
Median Absolute Deviation (MAD)32503.59913
Skewness50.78562969
Sum7436061148
Variance5576219605
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[5.3600000e+03 7.8700000e+03 9.9380000e+03 1.0000500e+04 1.0984000e+04 ... 5.0200000e+05 7.5500000e+05 1.2270000e+06 1.6220005e+06 8.3000000e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
60000 3978 4.0%
 
50000 3526 3.5%
 
65000 2859 2.9%
 
40000 2851 2.9%
 
70000 2665 2.7%
 
80000 2527 2.5%
 
45000 2523 2.5%
 
75000 2492 2.5%
 
55000 2425 2.4%
 
100000 1985 2.0%
 
Other values (8895) 72169 72.2%
 
ValueCountFrequency (%) 
5360 1 < 0.1%
 
6000 1 < 0.1%
 
7000 3 < 0.1%
 
7111 1 < 0.1%
 
7200 1 < 0.1%
 
ValueCountFrequency (%) 
8300000 1 < 0.1%
 
8253000 1 < 0.1%
 
8121180 1 < 0.1%
 
5604824 1 < 0.1%
 
4800000 1 < 0.1%
 

dti
Real number (ℝ≥0)

Distinct count4148
Unique (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5145081
Minimum0
Maximum49.93
Zeros31
Zeros (%)< 0.1%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile5.34
Q112.2
median18.06
Q324.53
95-th percentile33.0105
Maximum49.93
Range49.93
Interquartile range (IQR)12.33

Descriptive statistics

Standard deviation8.413049281
Coefficient of variation (CV)0.4544030679
Kurtosis-0.5291402237
Mean18.5145081
Median Absolute Deviation (MAD)6.919389621
Skewness0.2131181916
Sum1851450.81
Variance70.7793982
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 1.0000e-02 2.3500e-01 8.5500e-01 1.9350e+00 ... 3.4995e+01 3.8385e+01 3.9985e+01 4.6510e+01 4.9930e+01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
15.6 84 0.1%
 
14.4 80 0.1%
 
19.2 79 0.1%
 
13.2 72 0.1%
 
20.4 72 0.1%
 
18.72 69 0.1%
 
14.22 68 0.1%
 
10.8 67 0.1%
 
16.8 67 0.1%
 
18 66 0.1%
 
Other values (4138) 99276 99.3%
 
ValueCountFrequency (%) 
0 31 < 0.1%
 
0.02 2 < 0.1%
 
0.03 2 < 0.1%
 
0.05 1 < 0.1%
 
0.06 3 < 0.1%
 
ValueCountFrequency (%) 
49.93 1 < 0.1%
 
49.86 1 < 0.1%
 
49.61 1 < 0.1%
 
49.59 2 < 0.1%
 
49.57 1 < 0.1%
 

delinq_2yrs
Real number (ℝ≥0)

ZEROS
Distinct count20
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3437
Minimum0
Maximum20
Zeros79483
Zeros (%)79.5%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum20
Range20
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.9050074587
Coefficient of variation (CV)2.633131972
Kurtosis42.31549297
Mean0.3437
Median Absolute Deviation (MAD)0.546366142
Skewness4.985413247
Sum34370
Variance0.8190385004
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 7.5 9.5 12.5 14.5 20. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 79483 79.5%
 
1 13290 13.3%
 
2 4163 4.2%
 
3 1557 1.6%
 
4 711 0.7%
 
5 328 0.3%
 
6 189 0.2%
 
7 111 0.1%
 
8 55 0.1%
 
9 37 < 0.1%
 
Other values (10) 76 0.1%
 
ValueCountFrequency (%) 
0 79483 79.5%
 
1 13290 13.3%
 
2 4163 4.2%
 
3 1557 1.6%
 
4 711 0.7%
 
ValueCountFrequency (%) 
20 1 < 0.1%
 
19 1 < 0.1%
 
18 3 < 0.1%
 
16 2 < 0.1%
 
15 2 < 0.1%
 

inq_last_6mths
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.66888
Minimum0
Maximum6
Zeros56711
Zeros (%)56.7%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9520444367
Coefficient of variation (CV)1.423341162
Kurtosis3.186907213
Mean0.66888
Median Absolute Deviation (MAD)0.7586570736
Skewness1.689340023
Sum66888
Variance0.9063886095
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 56711 56.7%
 
1 27487 27.5%
 
2 10213 10.2%
 
3 3955 4.0%
 
4 1127 1.1%
 
5 440 0.4%
 
6 67 0.1%
 
ValueCountFrequency (%) 
0 56711 56.7%
 
1 27487 27.5%
 
2 10213 10.2%
 
3 3955 4.0%
 
4 1127 1.1%
 
ValueCountFrequency (%) 
6 67 0.1%
 
5 440 0.4%
 
4 1127 1.1%
 
3 3955 4.0%
 
2 10213 10.2%
 

pub_rec
Real number (ℝ≥0)

ZEROS
Distinct count22
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.23572
Minimum0
Maximum63
Zeros81999
Zeros (%)82.0%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum63
Range63
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6614684097
Coefficient of variation (CV)2.806161589
Kurtosis899.892579
Mean0.23572
Median Absolute Deviation (MAD)0.3865760856
Skewness14.14625367
Sum23572
Variance0.437540457
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 6.5 8.5 14.5 23.5 63. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 81999 82.0%
 
1 14722 14.7%
 
2 2143 2.1%
 
3 646 0.6%
 
4 250 0.2%
 
5 119 0.1%
 
6 55 0.1%
 
7 23 < 0.1%
 
8 13 < 0.1%
 
10 10 < 0.1%
 
Other values (12) 20 < 0.1%
 
ValueCountFrequency (%) 
0 81999 82.0%
 
1 14722 14.7%
 
2 2143 2.1%
 
3 646 0.6%
 
4 250 0.2%
 
ValueCountFrequency (%) 
63 1 < 0.1%
 
25 1 < 0.1%
 
22 1 < 0.1%
 
21 1 < 0.1%
 
20 1 < 0.1%
 

revol_bal
Real number (ℝ≥0)

Distinct count36002
Unique (%)36.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16090.20282
Minimum0
Maximum971736
Zeros265
Zeros (%)0.3%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile1926.95
Q16009
median11030.5
Q319540
95-th percentile42708.05
Maximum971736
Range971736
Interquartile range (IQR)13531

Descriptive statistics

Standard deviation21569.93927
Coefficient of variation (CV)1.340563541
Kurtosis238.7511732
Mean16090.20282
Median Absolute Deviation (MAD)11053.96217
Skewness10.40280683
Sum1609020282
Variance465262280.1
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000000e+00 5.000000e-01 3.050000e+01 6.505000e+02 1.401500e+03 ... 1.389440e+05 2.101015e+05 3.047935e+05 4.405165e+05 9.717360e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 265 0.3%
 
3659 17 < 0.1%
 
8140 16 < 0.1%
 
5051 16 < 0.1%
 
4136 15 < 0.1%
 
4119 15 < 0.1%
 
6219 15 < 0.1%
 
7253 15 < 0.1%
 
6528 14 < 0.1%
 
8202 14 < 0.1%
 
Other values (35992) 99598 99.6%
 
ValueCountFrequency (%) 
0 265 0.3%
 
1 1 < 0.1%
 
2 5 < 0.1%
 
3 4 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
971736 1 < 0.1%
 
959754 1 < 0.1%
 
882984 1 < 0.1%
 
779021 1 < 0.1%
 
757281 1 < 0.1%
 

total_acc
Real number (ℝ≥0)

Distinct count107
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.67591
Minimum2
Maximum176
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum2
5-th percentile9
Q116
median23
Q331
95-th percentile47
Maximum176
Range174
Interquartile range (IQR)15

Descriptive statistics

Standard deviation11.88383389
Coefficient of variation (CV)0.481596581
Kurtosis2.017509218
Mean24.67591
Median Absolute Deviation (MAD)9.298856599
Skewness0.9837047494
Sum2467591
Variance141.2255079
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 3.5 4.5 5.5 6.5 ... 73.5 82.5 90.5 101. 176. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20 3645 3.6%
 
19 3627 3.6%
 
21 3622 3.6%
 
18 3616 3.6%
 
22 3604 3.6%
 
17 3548 3.5%
 
16 3503 3.5%
 
23 3477 3.5%
 
24 3446 3.4%
 
15 3393 3.4%
 
Other values (97) 64519 64.5%
 
ValueCountFrequency (%) 
2 21 < 0.1%
 
3 82 0.1%
 
4 337 0.3%
 
5 607 0.6%
 
6 890 0.9%
 
ValueCountFrequency (%) 
176 1 < 0.1%
 
173 1 < 0.1%
 
144 1 < 0.1%
 
140 1 < 0.1%
 
117 1 < 0.1%
 

collections_12_mths_ex_med
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01881
Minimum0
Maximum5
Zeros98290
Zeros (%)98.3%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1503210227
Coefficient of variation (CV)7.991548256
Kurtosis120.9612827
Mean0.01881
Median Absolute Deviation (MAD)0.036976698
Skewness9.594333204
Sum1881
Variance0.02259640986
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 98290 98.3%
 
1 1568 1.6%
 
2 119 0.1%
 
3 18 < 0.1%
 
4 4 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
0 98290 98.3%
 
1 1568 1.6%
 
2 119 0.1%
 
3 18 < 0.1%
 
4 4 < 0.1%
 
ValueCountFrequency (%) 
5 1 < 0.1%
 
4 4 < 0.1%
 
3 18 < 0.1%
 
2 119 0.1%
 
1 1568 1.6%
 

acc_now_delinq
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0058
Minimum0
Maximum6
Zeros99459
Zeros (%)99.5%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.08358486624
Coefficient of variation (CV)14.41118383
Kurtosis661.4656622
Mean0.0058
Median Absolute Deviation (MAD)0.011537244
Skewness19.79346672
Sum580
Variance0.006986429864
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 99459 99.5%
 
1 514 0.5%
 
2 21 < 0.1%
 
3 3 < 0.1%
 
6 1 < 0.1%
 
5 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
0 99459 99.5%
 
1 514 0.5%
 
2 21 < 0.1%
 
3 3 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
6 1 < 0.1%
 
5 1 < 0.1%
 
4 1 < 0.1%
 
3 3 < 0.1%
 
2 21 < 0.1%
 

tot_coll_amt
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count3871
Unique (%)3.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.58812
Minimum0
Maximum197765
Zeros84157
Zeros (%)84.2%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile810.05
Maximum197765
Range197765
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2042.770881
Coefficient of variation (CV)8.151906328
Kurtosis1942.024635
Mean250.58812
Median Absolute Deviation (MAD)437.5063009
Skewness32.05463979
Sum25058812
Variance4172912.873
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000e+00 5.00000e+00 2.45000e+01 4.95000e+01 5.05000e+01 ... 1.86445e+04 2.67040e+04 4.07870e+04 6.73065e+04 1.97765e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 84157 84.2%
 
50 206 0.2%
 
100 172 0.2%
 
75 144 0.1%
 
150 95 0.1%
 
60 90 0.1%
 
200 88 0.1%
 
80 80 0.1%
 
55 75 0.1%
 
58 69 0.1%
 
Other values (3861) 14824 14.8%
 
ValueCountFrequency (%) 
0 84157 84.2%
 
10 2 < 0.1%
 
11 1 < 0.1%
 
15 1 < 0.1%
 
16 1 < 0.1%
 
ValueCountFrequency (%) 
197765 1 < 0.1%
 
169257 1 < 0.1%
 
129715 1 < 0.1%
 
111475 1 < 0.1%
 
102841 1 < 0.1%
 

tot_cur_bal
Real number (ℝ≥0)

Distinct count80613
Unique (%)80.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean130528.14
Minimum0
Maximum3164353
Zeros17
Zeros (%)< 0.1%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile8135
Q126989
median68023.5
Q3194309.75
95-th percentile414305
Maximum3164353
Range3164353
Interquartile range (IQR)167320.75

Descriptive statistics

Standard deviation150332.6261
Coefficient of variation (CV)1.151725798
Kurtosis16.0468943
Mean130528.14
Median Absolute Deviation (MAD)110425.7909
Skewness2.689505125
Sum1.3052814e+10
Variance2.259989848e+10
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000000e+00 1.0000000e+00 1.4055000e+03 2.4565000e+03 3.1635000e+03 ... 8.8456750e+05 1.0590610e+06 1.3821805e+06 1.7072620e+06 3.1643530e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 17 < 0.1%
 
21560 8 < 0.1%
 
14278 7 < 0.1%
 
6991 7 < 0.1%
 
20579 7 < 0.1%
 
39010 7 < 0.1%
 
24213 7 < 0.1%
 
31764 6 < 0.1%
 
25252 6 < 0.1%
 
22828 6 < 0.1%
 
Other values (80603) 99922 99.9%
 
ValueCountFrequency (%) 
0 17 < 0.1%
 
2 2 < 0.1%
 
4 1 < 0.1%
 
8 1 < 0.1%
 
9 1 < 0.1%
 
ValueCountFrequency (%) 
3164353 1 < 0.1%
 
2809127 1 < 0.1%
 
2655974 1 < 0.1%
 
2641479 1 < 0.1%
 
2519040 1 < 0.1%
 

chargeoff_within_12_mths
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.00963
Minimum0
Maximum7
Zeros99129
Zeros (%)99.1%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1100789911
Coefficient of variation (CV)11.43084019
Kurtosis407.2945699
Mean0.00963
Median Absolute Deviation (MAD)0.0190922454
Skewness15.63846089
Sum963
Variance0.01211738427
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 4.5 7. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 99129 99.1%
 
1 800 0.8%
 
2 59 0.1%
 
3 7 < 0.1%
 
4 3 < 0.1%
 
7 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
0 99129 99.1%
 
1 800 0.8%
 
2 59 0.1%
 
3 7 < 0.1%
 
4 3 < 0.1%
 
ValueCountFrequency (%) 
7 1 < 0.1%
 
5 1 < 0.1%
 
4 3 < 0.1%
 
3 7 < 0.1%
 
2 59 0.1%
 

delinq_amnt
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count333
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.28833
Minimum0
Maximum94521
Zeros99564
Zeros (%)99.6%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum94521
Range94521
Interquartile range (IQR)0

Descriptive statistics

Standard deviation893.3043658
Coefficient of variation (CV)46.31320419
Kurtosis4660.868735
Mean19.28833
Median Absolute Deviation (MAD)38.41182956
Skewness64.48766706
Sum1928833
Variance797992.6899
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 1.0000e+00 4.7500e+01 6.9500e+01 2.1500e+02 5.9100e+02 1.2975e+03 5.2950e+03 2.0990e+04 9.4521e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 99564 99.6%
 
25 9 < 0.1%
 
54 8 < 0.1%
 
65000 7 < 0.1%
 
30 6 < 0.1%
 
69 5 < 0.1%
 
50 5 < 0.1%
 
39 4 < 0.1%
 
52 4 < 0.1%
 
76 4 < 0.1%
 
Other values (323) 384 0.4%
 
ValueCountFrequency (%) 
0 99564 99.6%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
6 1 < 0.1%
 
7 2 < 0.1%
 
ValueCountFrequency (%) 
94521 1 < 0.1%
 
65000 7 < 0.1%
 
60648 1 < 0.1%
 
59427 1 < 0.1%
 
59009 1 < 0.1%
 

tax_liens
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count21
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.06382
Minimum0
Maximum63
Zeros96122
Zeros (%)96.1%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum63
Range63
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4680269203
Coefficient of variation (CV)7.333546228
Kurtosis3583.176702
Mean0.06382
Median Absolute Deviation (MAD)0.1226901208
Skewness35.83709048
Sum6382
Variance0.2190491981
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 5.5 6.5 10.5 23. 63. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 96122 96.1%
 
1 2655 2.7%
 
2 697 0.7%
 
3 254 0.3%
 
4 122 0.1%
 
5 74 0.1%
 
6 31 < 0.1%
 
7 11 < 0.1%
 
8 7 < 0.1%
 
9 6 < 0.1%
 
Other values (11) 21 < 0.1%
 
ValueCountFrequency (%) 
0 96122 96.1%
 
1 2655 2.7%
 
2 697 0.7%
 
3 254 0.3%
 
4 122 0.1%
 
ValueCountFrequency (%) 
63 1 < 0.1%
 
24 1 < 0.1%
 
22 1 < 0.1%
 
21 1 < 0.1%
 
20 1 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
93460
1
 
6540
ValueCountFrequency (%) 
0 93460 93.5%
 
1 6540 6.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
67376
1
32624
ValueCountFrequency (%) 
0 67376 67.4%
 
1 32624 32.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
91059
1
 
8941
ValueCountFrequency (%) 
0 91059 91.1%
 
1 8941 8.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
91942
1
 
8058
ValueCountFrequency (%) 
0 91942 91.9%
 
1 8058 8.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94052
1
 
5948
ValueCountFrequency (%) 
0 94052 94.1%
 
1 5948 5.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
93929
1
 
6071
ValueCountFrequency (%) 
0 93929 93.9%
 
1 6071 6.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95336
1
 
4664
ValueCountFrequency (%) 
0 95336 95.3%
 
1 4664 4.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95468
1
 
4532
ValueCountFrequency (%) 
0 95468 95.5%
 
1 4532 4.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95431
1
 
4569
ValueCountFrequency (%) 
0 95431 95.4%
 
1 4569 4.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
96143
1
 
3857
ValueCountFrequency (%) 
0 96143 96.1%
 
1 3857 3.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
92445
1
 
7555
ValueCountFrequency (%) 
0 92445 92.4%
 
1 7555 7.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
93359
1
 
6641
ValueCountFrequency (%) 
0 93359 93.4%
 
1 6641 6.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99967
1
 
33
ValueCountFrequency (%) 
0 99967 > 99.9%
 
1 33 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
53388
1
46612
ValueCountFrequency (%) 
0 53388 53.4%
 
1 46612 46.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99993
1
 
7
ValueCountFrequency (%) 
0 99993 > 99.9%
 
1 7 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99995
1
 
5
ValueCountFrequency (%) 
0 99995 > 99.9%
 
1 5 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
88912
1
 
11088
ValueCountFrequency (%) 
0 88912 88.9%
 
1 11088 11.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
57745
1
42255
ValueCountFrequency (%) 
0 57745 57.7%
 
1 42255 42.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
67820
1
32180
ValueCountFrequency (%) 
0 67820 67.8%
 
1 32180 32.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
60980
1
39020
ValueCountFrequency (%) 
0 60980 61.0%
 
1 39020 39.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
71200
1
28800
ValueCountFrequency (%) 
0 71200 71.2%
 
1 28800 28.8%
 

purpose1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99038
1
 
962
ValueCountFrequency (%) 
0 99038 99.0%
 
1 962 1.0%
 

purpose2
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
77328
1
22672
ValueCountFrequency (%) 
0 77328 77.3%
 
1 22672 22.7%
 

purpose3
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
1
58281
0
41719
ValueCountFrequency (%) 
1 58281 58.3%
 
0 41719 41.7%
 

purpose4
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
100000
ValueCountFrequency (%) 
0 100000 100.0%
 

purpose5
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
93960
1
 
6040
ValueCountFrequency (%) 
0 93960 94.0%
 
1 6040 6.0%
 

purpose6
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99575
1
 
425
ValueCountFrequency (%) 
0 99575 99.6%
 
1 425 0.4%
 

purpose7
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
98017
1
 
1983
ValueCountFrequency (%) 
0 98017 98.0%
 
1 1983 2.0%
 

purpose8
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
98899
1
 
1101
ValueCountFrequency (%) 
0 98899 98.9%
 
1 1101 1.1%
 

purpose9
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99269
1
 
731
ValueCountFrequency (%) 
0 99269 99.3%
 
1 731 0.7%
 

purpose10
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94180
1
 
5820
ValueCountFrequency (%) 
0 94180 94.2%
 
1 5820 5.8%
 

purpose11
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99935
1
 
65
ValueCountFrequency (%) 
0 99935 99.9%
 
1 65 0.1%
 

purpose12
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
98875
1
 
1125
ValueCountFrequency (%) 
0 98875 98.9%
 
1 1125 1.1%
 

purpose13
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99293
1
 
707
ValueCountFrequency (%) 
0 99293 99.3%
 
1 707 0.7%
 

purpose14
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
99912
1
 
88
ValueCountFrequency (%) 
0 99912 99.9%
 
1 88 0.1%
 

initial_list_status1
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
56557
1
43443
ValueCountFrequency (%) 
0 56557 56.6%
 
1 43443 43.4%
 

initial_list_status2
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
1
56557
0
43443
ValueCountFrequency (%) 
1 56557 56.6%
 
0 43443 43.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
51068
1
48932
ValueCountFrequency (%) 
0 51068 51.1%
 
1 48932 48.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94764
1
 
5236
ValueCountFrequency (%) 
0 94764 94.8%
 
1 5236 5.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
93991
1
 
6009
ValueCountFrequency (%) 
0 93991 94.0%
 
1 6009 6.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95430
1
 
4570
ValueCountFrequency (%) 
0 95430 95.4%
 
1 4570 4.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94925
1
 
5075
ValueCountFrequency (%) 
0 94925 94.9%
 
1 5075 5.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95144
1
 
4856
ValueCountFrequency (%) 
0 95144 95.1%
 
1 4856 4.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94866
1
 
5134
ValueCountFrequency (%) 
0 94866 94.9%
 
1 5134 5.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94657
1
 
5343
ValueCountFrequency (%) 
0 94657 94.7%
 
1 5343 5.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95104
1
 
4896
ValueCountFrequency (%) 
0 95104 95.1%
 
1 4896 4.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
94997
1
 
5003
ValueCountFrequency (%) 
0 94997 95.0%
 
1 5003 5.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
95054
1
 
4946
ValueCountFrequency (%) 
0 95054 95.1%
 
1 4946 4.9%
 

funded_amnt
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1355
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13735.31775
Minimum1000
Maximum40000
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum1000
5-th percentile3075
Q17200
median12000
Q319200
95-th percentile31127.5
Maximum40000
Range39000
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation8464.825314
Coefficient of variation (CV)0.6162817248
Kurtosis0.08352881489
Mean13735.31775
Median Absolute Deviation (MAD)6857.992112
Skewness0.8703433803
Sum1373531775
Variance71653267.6
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1000. 1012.5 1187.5 1212.5 1387.5 ... 35100. 35875. 36112.5 39962.5 40000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10000 7489 7.5%
 
15000 5113 5.1%
 
12000 5007 5.0%
 
20000 4866 4.9%
 
5000 4147 4.1%
 
8000 4041 4.0%
 
6000 3689 3.7%
 
35000 3539 3.5%
 
16000 2336 2.3%
 
25000 2281 2.3%
 
Other values (1345) 57492 57.5%
 
ValueCountFrequency (%) 
1000 377 0.4%
 
1025 1 < 0.1%
 
1050 1 < 0.1%
 
1100 10 < 0.1%
 
1125 6 < 0.1%
 
ValueCountFrequency (%) 
40000 236 0.2%
 
39925 2 < 0.1%
 
39775 1 < 0.1%
 
39700 1 < 0.1%
 
39600 2 < 0.1%
 

funded_amnt_inv
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1391
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13729.34107
Minimum800
Maximum40000
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum800
5-th percentile3050
Q17200
median12000
Q319200
95-th percentile31100
Maximum40000
Range39200
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation8461.694483
Coefficient of variation (CV)0.6163219661
Kurtosis0.08336136761
Mean13729.34107
Median Absolute Deviation (MAD)6855.342693
Skewness0.8704126735
Sum1372934107
Variance71600273.53
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 800. 975. 1012.5 1187.5 1212.5 ... 36112.5 39687.5 39762.5 39987.5 40000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10000 7068 7.1%
 
15000 4684 4.7%
 
12000 4658 4.7%
 
20000 4345 4.3%
 
5000 4063 4.1%
 
8000 3872 3.9%
 
6000 3599 3.6%
 
35000 3040 3.0%
 
16000 2150 2.1%
 
25000 2072 2.1%
 
Other values (1381) 60449 60.4%
 
ValueCountFrequency (%) 
800 1 < 0.1%
 
950 1 < 0.1%
 
1000 376 0.4%
 
1025 1 < 0.1%
 
1050 1 < 0.1%
 
ValueCountFrequency (%) 
40000 194 0.2%
 
39975 4 < 0.1%
 
39950 6 < 0.1%
 
39925 3 < 0.1%
 
39875 2 < 0.1%
 

total_rec_late_fee
Real number (ℝ)

ZEROS
Distinct count2752
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.448885165
Minimum-2e-09
Maximum874.17
Zeros93433
Zeros (%)93.4%
Memory size781.4 KiB

Quantile statistics

Minimum-2e-09
5-th percentile0
Q10
median0
Q30
95-th percentile15
Maximum874.17
Range874.17
Interquartile range (IQR)0

Descriptive statistics

Standard deviation14.89496269
Coefficient of variation (CV)6.082344288
Kurtosis620.5887133
Mean2.448885165
Median Absolute Deviation (MAD)4.576827973
Skewness18.33148632
Sum244888.5165
Variance221.8599136
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-2.00000000e-09 -9.00000000e-10 1.50000000e-10 3.60000000e-09 5.00000190e-03 ... 1.05040000e+02 1.34970000e+02 2.06175001e+02 3.27505000e+02 8.74170000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 93433 93.4%
 
15 1695 1.7%
 
30 295 0.3%
 
45 111 0.1%
 
60 43 < 0.1%
 
15.00000001 38 < 0.1%
 
16.49 20 < 0.1%
 
75 18 < 0.1%
 
17.09 16 < 0.1%
 
15.00000004 15 < 0.1%
 
Other values (2742) 4316 4.3%
 
ValueCountFrequency (%) 
-2e-09 1 < 0.1%
 
-1.8e-09 1 < 0.1%
 
0 93433 93.4%
 
3e-10 1 < 0.1%
 
5e-10 1 < 0.1%
 
ValueCountFrequency (%) 
874.17 1 < 0.1%
 
819.2 1 < 0.1%
 
759.56 1 < 0.1%
 
731.8400062 1 < 0.1%
 
634.9 1 < 0.1%
 

term1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
1
85592
0
 
14408
ValueCountFrequency (%) 
1 85592 85.6%
 
0 14408 14.4%
 

open_acc
Real number (ℝ≥0)

Distinct count62
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.6207
Minimum1
Maximum82
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum1
5-th percentile5
Q18
median11
Q314
95-th percentile22
Maximum82
Range81
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.458773625
Coefficient of variation (CV)0.4697456801
Kurtosis3.844543526
Mean11.6207
Median Absolute Deviation (MAD)4.15537797
Skewness1.343695637
Sum1162070
Variance29.79820949
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 36.5 40.5 45.5 53.5 82. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
9 8985 9.0%
 
10 8718 8.7%
 
8 8439 8.4%
 
11 7978 8.0%
 
7 7718 7.7%
 
12 7317 7.3%
 
6 6336 6.3%
 
13 6056 6.1%
 
14 5409 5.4%
 
5 4598 4.6%
 
Other values (52) 28446 28.4%
 
ValueCountFrequency (%) 
1 19 < 0.1%
 
2 298 0.3%
 
3 1100 1.1%
 
4 2666 2.7%
 
5 4598 4.6%
 
ValueCountFrequency (%) 
82 1 < 0.1%
 
76 1 < 0.1%
 
72 1 < 0.1%
 
67 1 < 0.1%
 
66 1 < 0.1%
 

installment
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count25708
Unique (%)25.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean434.0776476
Minimum23.36
Maximum1584.9
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum23.36
5-th percentile107.82
Q1240.2925
median366.37
Q3575.86
95-th percentile982.021
Maximum1584.9
Range1561.54
Interquartile range (IQR)335.5675

Descriptive statistics

Standard deviation265.9217457
Coefficient of variation (CV)0.6126133127
Kurtosis0.8319156207
Mean434.0776476
Median Absolute Deviation (MAD)209.962902
Skewness1.06302314
Sum43407764.76
Variance70714.37482
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 23.36 30.68 32.94 32.99 34.175 ... 1342.89 1343.83 1401.05 1501.96 1584.9 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
327.34 244 0.2%
 
301.15 226 0.2%
 
318.79 209 0.2%
 
329.72 203 0.2%
 
312.86 199 0.2%
 
332.1 194 0.2%
 
392.81 192 0.2%
 
336.9 192 0.2%
 
491.01 189 0.2%
 
654.68 188 0.2%
 
Other values (25698) 97964 98.0%
 
ValueCountFrequency (%) 
23.36 1 < 0.1%
 
25.86 1 < 0.1%
 
28.82 1 < 0.1%
 
29 1 < 0.1%
 
29.52 1 < 0.1%
 
ValueCountFrequency (%) 
1584.9 1 < 0.1%
 
1569.11 1 < 0.1%
 
1517.09 1 < 0.1%
 
1503.89 1 < 0.1%
 
1500.03 1 < 0.1%
 

revol_util
Real number (ℝ≥0)

Distinct count1101
Unique (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.53723294
Minimum0
Maximum8.923
Zeros312
Zeros (%)0.3%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0.134
Q10.361
median0.541
Q30.72
95-th percentile0.92
Maximum8.923
Range8.923
Interquartile range (IQR)0.359

Descriptive statistics

Standard deviation0.2393730716
Coefficient of variation (CV)0.4455666318
Kurtosis14.27000246
Mean0.53723294
Median Absolute Deviation (MAD)0.1977664045
Skewness0.3333455311
Sum53723.294
Variance0.05729946739
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-04 5.5500e-02 1.0950e-01 1.3650e-01 ... 1.0370e+00 1.0900e+00 1.2125e+00 1.6235e+00 8.9230e+00], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 312 0.3%
 
0.48 232 0.2%
 
0.57 227 0.2%
 
0.44 221 0.2%
 
0.49 218 0.2%
 
0.54 217 0.2%
 
0.61 216 0.2%
 
0.58 214 0.2%
 
0.6 213 0.2%
 
0.51 206 0.2%
 
Other values (1091) 97724 97.7%
 
ValueCountFrequency (%) 
0 312 0.3%
 
0.001 44 < 0.1%
 
0.002 36 < 0.1%
 
0.003 32 < 0.1%
 
0.004 34 < 0.1%
 
ValueCountFrequency (%) 
8.923 1 < 0.1%
 
1.72 1 < 0.1%
 
1.527 1 < 0.1%
 
1.48 1 < 0.1%
 
1.399 1 < 0.1%
 

out_prncp
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS
Distinct count28
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2533273
Minimum0
Maximum2330.97
Zeros99973
Zeros (%)> 99.9%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum2330.97
Range2330.97
Interquartile range (IQR)0

Descriptive statistics

Standard deviation18.05329008
Coefficient of variation (CV)71.26468437
Kurtosis8228.815507
Mean0.2533273
Median Absolute Deviation (MAD)0.5065178033
Skewness85.44453918
Sum25332.73
Variance325.9212826
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 21.63 2330.97], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 99973 > 99.9%
 
508 1 < 0.1%
 
471.7 1 < 0.1%
 
1796.48 1 < 0.1%
 
136.24 1 < 0.1%
 
687.93 1 < 0.1%
 
975.05 1 < 0.1%
 
1527.69 1 < 0.1%
 
949.15 1 < 0.1%
 
1345.74 1 < 0.1%
 
Other values (18) 18 < 0.1%
 
ValueCountFrequency (%) 
0 99973 > 99.9%
 
43.26 1 < 0.1%
 
136.24 1 < 0.1%
 
216.74 1 < 0.1%
 
283.37 1 < 0.1%
 
ValueCountFrequency (%) 
2330.97 1 < 0.1%
 
2081.26 1 < 0.1%
 
1796.48 1 < 0.1%
 
1628.56 1 < 0.1%
 
1527.69 1 < 0.1%
 

out_prncp_inv
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS
Distinct count28
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2532594
Minimum0
Maximum2330.97
Zeros99973
Zeros (%)> 99.9%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum2330.97
Range2330.97
Interquartile range (IQR)0

Descriptive statistics

Standard deviation18.05174595
Coefficient of variation (CV)71.27769375
Kurtosis8231.340971
Mean0.2532594
Median Absolute Deviation (MAD)0.5063820399
Skewness85.45945131
Sum25325.94
Variance325.8655319
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 21.63 2330.97], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 99973 > 99.9%
 
508 1 < 0.1%
 
686.21 1 < 0.1%
 
1796.48 1 < 0.1%
 
136.24 1 < 0.1%
 
975.05 1 < 0.1%
 
1527.69 1 < 0.1%
 
949.15 1 < 0.1%
 
470.13 1 < 0.1%
 
1345.74 1 < 0.1%
 
Other values (18) 18 < 0.1%
 
ValueCountFrequency (%) 
0 99973 > 99.9%
 
43.26 1 < 0.1%
 
136.24 1 < 0.1%
 
214.93 1 < 0.1%
 
281.68 1 < 0.1%
 
ValueCountFrequency (%) 
2330.97 1 < 0.1%
 
2081.26 1 < 0.1%
 
1796.48 1 < 0.1%
 
1628.56 1 < 0.1%
 
1527.69 1 < 0.1%
 

total_rec_int
Real number (ℝ≥0)

Distinct count85095
Unique (%)85.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2491.282802
Minimum0
Maximum28005.96
Zeros170
Zeros (%)0.2%
Memory size781.4 KiB

Quantile statistics

Minimum0
5-th percentile263.4885
Q1857.2925
median1615.16
Q33039.115
95-th percentile7904.314
Maximum28005.96
Range28005.96
Interquartile range (IQR)2181.8225

Descriptive statistics

Standard deviation2706.2622
Coefficient of variation (CV)1.086292651
Kurtosis10.99100512
Mean2491.282802
Median Absolute Deviation (MAD)1809.982536
Skewness2.788551541
Sum249128280.2
Variance7323855.092
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-03 7.9450000e+00 8.0210000e+01 1.5729500e+02 ... 1.7179100e+04 1.7970380e+04 2.1354240e+04 2.5706025e+04 2.8005960e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 170 0.2%
 
1257.53 34 < 0.1%
 
1258.44 27 < 0.1%
 
838.35 24 < 0.1%
 
1254.05 22 < 0.1%
 
1676.72 22 < 0.1%
 
1670.81 22 < 0.1%
 
1431.12 21 < 0.1%
 
2128.02 20 < 0.1%
 
2347.4 20 < 0.1%
 
Other values (85085) 99618 99.6%
 
ValueCountFrequency (%) 
0 170 0.2%
 
0.01 1 < 0.1%
 
0.6 1 < 0.1%
 
0.62 1 < 0.1%
 
0.71 1 < 0.1%
 
ValueCountFrequency (%) 
28005.96 1 < 0.1%
 
27884.8 1 < 0.1%
 
27862.51 1 < 0.1%
 
27663.62 1 < 0.1%
 
26952.39 1 < 0.1%
 

fico_range_low
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count38
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean692.63055
Minimum660
Maximum845
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum660
5-th percentile660
Q1670
median685
Q3705
95-th percentile755
Maximum845
Range185
Interquartile range (IQR)35

Descriptive statistics

Standard deviation29.66801744
Coefficient of variation (CV)0.04283382741
Kurtosis2.343301129
Mean692.63055
Median Absolute Deviation (MAD)22.79368871
Skewness1.419130191
Sum69263055
Variance880.1912586
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[660. 662.5 672.5 682.5 692.5 ... 797.5 807.5 817.5 827.5 845. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
660 10315 10.3%
 
665 9594 9.6%
 
670 9531 9.5%
 
675 8396 8.4%
 
680 8280 8.3%
 
685 6834 6.8%
 
690 6678 6.7%
 
695 5960 6.0%
 
700 5234 5.2%
 
705 4785 4.8%
 
Other values (28) 24393 24.4%
 
ValueCountFrequency (%) 
660 10315 10.3%
 
665 9594 9.6%
 
670 9531 9.5%
 
675 8396 8.4%
 
680 8280 8.3%
 
ValueCountFrequency (%) 
845 11 < 0.1%
 
840 20 < 0.1%
 
835 24 < 0.1%
 
830 31 < 0.1%
 
825 60 0.1%
 

fico_range_high
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count38
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean696.63066
Minimum664
Maximum850
Zeros0
Zeros (%)0.0%
Memory size781.4 KiB

Quantile statistics

Minimum664
5-th percentile664
Q1674
median689
Q3709
95-th percentile759
Maximum850
Range186
Interquartile range (IQR)35

Descriptive statistics

Standard deviation29.66858423
Coefficient of variation (CV)0.0425886857
Kurtosis2.344900675
Mean696.63066
Median Absolute Deviation (MAD)22.79381989
Skewness1.419333046
Sum69663066
Variance880.2248902
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[664. 666.5 676.5 686.5 696.5 ... 801.5 811.5 821.5 831.5 850. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
664 10315 10.3%
 
669 9594 9.6%
 
674 9531 9.5%
 
679 8396 8.4%
 
684 8280 8.3%
 
689 6834 6.8%
 
694 6678 6.7%
 
699 5960 6.0%
 
704 5234 5.2%
 
709 4785 4.8%
 
Other values (28) 24393 24.4%
 
ValueCountFrequency (%) 
664 10315 10.3%
 
669 9594 9.6%
 
674 9531 9.5%
 
679 8396 8.4%
 
684 8280 8.3%
 
ValueCountFrequency (%) 
850 11 < 0.1%
 
844 20 < 0.1%
 
839 24 < 0.1%
 
834 31 < 0.1%
 
829 60 0.1%
 

depvar
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size781.4 KiB
0
67431
1
32569
ValueCountFrequency (%) 
0 67431 67.4%
 
1 32569 32.6%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

int_rateannual_incdtidelinq_2yrsinq_last_6mthspub_recrevol_baltotal_acccollections_12_mths_ex_medacc_now_delinqtot_coll_amttot_cur_balchargeoff_within_12_mthsdelinq_amnttax_liensemp_length1emp_length2emp_length3emp_length4emp_length5emp_length6emp_length7emp_length8emp_length9emp_length10emp_length11emp_length12home_ownership1home_ownership2home_ownership3home_ownership4home_ownership5home_ownership6verification_status1verification_status2verification_status3purpose1purpose2purpose3purpose4purpose5purpose6purpose7purpose8purpose9purpose10purpose11purpose12purpose13purpose14initial_list_status1initial_list_status2mths_since_last_delinq1mths_since_last_delinq2mths_since_last_delinq3mths_since_last_delinq4mths_since_last_delinq5mths_since_last_delinq6mths_since_last_delinq7mths_since_last_delinq8mths_since_last_delinq9mths_since_last_delinq10mths_since_last_delinq11funded_amntfunded_amnt_invtotal_rec_late_feeterm1open_accinstallmentrevol_utilout_prncpout_prncp_invtotal_rec_intfico_range_lowfico_range_highdepvar
00.082421000.029.190103016260001177300010000000000000000101001000000000000011000000000012001200.00.011837.740.0760.00.0157.947657690
10.129980000.04.820115722240002187500001000000000000000100100100000000000011000000000080008000.00.018269.520.4470.00.01702.426656690
20.129938000.023.660306511180003186800000010000000000000101000100000000000010000000000150005000.00.017168.450.8800.00.01066.646706740
30.1367100000.016.274206849300003260490000010000000000100000010010000000000001000010000001500015000.00.0112510.270.4570.00.01256.246806841
40.126930000.025.2801281971200250688400000001000000000000010100010000000000001100000000001000010000.00.018335.450.4160.00.0871.046606641
50.131890000.03.487002903310001954800000000000001000000101001000000000000010010000000070007000.00.017236.470.8540.00.01330.306606640
60.079160000.017.3400020399350001173200000000000000010100001000010000000000001000000001002000020000.00.0118625.900.3690.00.02567.447507540
70.097579600.015.97301648117000187570010100000000000000010010010000000000010000100000002380023800.00.017765.170.7450.00.03006.766806840
80.0789150000.07.361001582738005433675040000010000000000100001000010000000000001001000000001600016000.00.0119500.580.3340.00.01286.386656690
90.099995000.025.78010178703700027377600000000000001000000110000100000000000011000000000080008000.00.0124258.100.7640.00.0796.596656690

Last rows

int_rateannual_incdtidelinq_2yrsinq_last_6mthspub_recrevol_baltotal_acccollections_12_mths_ex_medacc_now_delinqtot_coll_amttot_cur_balchargeoff_within_12_mthsdelinq_amnttax_liensemp_length1emp_length2emp_length3emp_length4emp_length5emp_length6emp_length7emp_length8emp_length9emp_length10emp_length11emp_length12home_ownership1home_ownership2home_ownership3home_ownership4home_ownership5home_ownership6verification_status1verification_status2verification_status3purpose1purpose2purpose3purpose4purpose5purpose6purpose7purpose8purpose9purpose10purpose11purpose12purpose13purpose14initial_list_status1initial_list_status2mths_since_last_delinq1mths_since_last_delinq2mths_since_last_delinq3mths_since_last_delinq4mths_since_last_delinq5mths_since_last_delinq6mths_since_last_delinq7mths_since_last_delinq8mths_since_last_delinq9mths_since_last_delinq10mths_since_last_delinq11funded_amntfunded_amnt_invtotal_rec_late_feeterm1open_accinstallmentrevol_utilout_prncpout_prncp_invtotal_rec_intfico_range_lowfico_range_highdepvar
999900.1446160000.02.48021480215000117500000000010000000000011000000000001000001100000000001200012000.00.0110412.820.1610.00.0234.696756790
999910.099979000.022.120106896280001277740000000000000010100000100010000000000010100000000001500014900.00.019483.940.3750.00.02412.167457490
999920.081858000.016.1200015562440023363669600001000000000000000110001000000000000100000000010080007900.00.0117251.360.5810.00.01035.596606640
999930.114473000.012.300002155126000215510000100000000000000010100100000000000001100000000002200022000.00.017724.850.8420.00.04118.687057090
999940.156188000.019.3801026976260003247470000000000010000100000100010000000000010100000000002130021300.00.0118744.760.8560.00.05575.436856890
999950.175765000.017.670311125521100265700000000001000000000010010010000000000001100000000002000020000.00.0113718.750.7800.00.05373.296606641
999960.089065000.02.88000210512000613800000000000001000000101000100000000000101000000000060006000.00.017190.520.1200.00.0835.667657690
999970.134946000.032.120108998200009653100000010000000000000110001000000000000011000000000064006400.00.0119217.160.6430.00.01261.676656690
999980.211531000.04.5301038754000387500000000000000100000100100000000010000011000000000055005500.00.013207.640.7310.00.01357.697107141
999990.1599125000.033.3300034580300004226260000000100000000100000010010000000000001100000000003312533125.00.01191164.420.4990.00.08882.586906940